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Greek Character Encoding for Electronic Mail Messages 
Status of This Memo 


This memo provides information for the Internet community. This memo 
does not specify an Internet standard of any kind. Distribution of 
this memo is unlimited. 


Overview and Rational 


This document describes a standard encoding for electronic mail 
[RFC822] containing Greek text and provides implementation guide- 
lines. The standard is based on MIME [RFC1521] and the ISO 8859-7 
character encoding. Although the implementation of this standard is 


straightforward several non-standard but "functional" - though 
unlikely to inter-operate - alternatives are in common use. For this 
reason we highlight common implementation and mail user agent setup 
errors. 

Description 


In order to transfer Greek text via electronic mail the text is first 
translated into the ISO 8859-7 character set, and then encoded using 
either the Base64 (preferable for text that is mainly Greek) or the 
Quoted-Printable (justifiable in cases where some Greek words appear 
inside predominately Latin text) method, as defined in MIME. 


The following table provides most common Greek encodings (see also 
[RFC1345]): 


0646 37 M7 51 MC 23 69 LG L1 G7 GO GC 28 97 Description 


0386 ea a2 86 cd 71 86 b6 Capital alpha with acute 

0388 eb b8 8d ce 72 8d b8 Capital epsilon with 
acute 

0389 ec b9 8f d7 73 8f b9 Capital eta with acute 

038a ed ba 90 d8 75 90 ba Capital iota with acute 

038c ee bc 92 d9 76 92 bc Capital omicron with 
acute 

038e ef be 95 da 77 95 be Capital upsilon with 
acute 

038f fO bf 98 df 78 98 bf Capital omega with acute 

0390 c0 al fd al c0 Small iota with acute and 
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0391 
0392 
0393 
0394 
0395 
0396 
0397 
0398 
0399 
039a 
039b 
039c 
039d 
039e 
039f 
03a0 
O3al 
03a3 
03a4 
03a5 
03a6 
03a7 
03a8 
03a9 
O3aa 


80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
8a 
8b 
8c 
8d 
8e 
8f 
90 
91 
92 
93 
94 
95 
96 
97 


O3ab 


O3ac 
O3ad 
O3ae 
O3aft 
03b0 


el 
e2 
e3 
e5 


03b1 
03b2 
03b3 
03b4 
03b5 
03b6 
03b7 
03b8 
03b9 
O3ba 
03bb 
03bc 
03bd 


98 
99 
9a 
9b 
9c 
9d 
9e 
9f 
ad 
al 
a2 
a3 
a4 


Spinellis 


cl 
c2 
c3 
c4 
c5 
c6 
c7 
c8 
c9 
ca 
cb 
cc 
cd 
ce 
Gf 
do 
dl 
d3 
d4 
d5 
d6 
a7 
d8 
d9 
da 


db 


dc 
dd 
de 
df 
e0 


el 
e2 
e3 
e4 
e5 


a4 
a5 
a6 
a7 
a8 
a9 
aa 
ac 
ad 
b5 
b6 
b8 
b7 
bd 
be 
c6 
eT 
CE 
do 
dl 
d2 
d3 
d4 
d5 


9b 
9d 
9e 
9f 
fG 


d6 
a7 
d8 
dd 
de 


bO 
b5 
al 
a2 
b6 
b7 
b8 
a3 
b9 
ba 
a4 
bb 
oul 
a5 
C3 
a6 
c4 
aa 
c6 
cb 
be 
cc 
be 
bf 
ab 


bd 


co 
db 
dc 
dd 
fe 


el 
e2 
e7 
e4 
e5 
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41 
42 
43 
44 
45 
46 
47 
48 
49 
51 
52 
53 
54 
55 
56 
57 
58 
59 
62 
63 
64 
65 
66 
67 


b1 
b2 
b3 
b5 


8a 
8b 
8c 
8d 
8e 


a4 
a5 
a6 
a7 
a8 
a9 
aa 
ac 
ad 
b5 
b6 
b7 
b8 
bd 
be 
c6 
c7 
cf 
do 
dl 
d2 
d3 
d4 
d5 
91 


96 


9b 
9d 
9e 
9f 
fO 


d6 
d7 
d8 
dd 
de 


e6 
e7 
e8 
e9 
ea 
eb 
ec 
ed 


e0 
el 
e2 
e3 
e4 
e5 
e6 
e7 


fa 
e8 
£5 
e9 
eb 
ec 
ed 
ee 


8f 
9a 
9b 
9c 
9d 
9e 
9f 
aa 


e0 
el 
e2 
e3 
e4 
e5 
e6 
e7 


61 
62 
67 
64 
65 
7a 
68 
75 
69 
6b 
6c 
6d 
6e 
6a 
6f 
70 
72 
73 
74 
79 
66 
78 
63 
76 


23 
40 


5c 


5e 


21 


3£ 


5f 


5d 


3a 
5b 


41 
42 
43 
44 
45 
46 
47 
48 
49 
4b 
4c 
4d 
4e 
4f 
50 
51 
52 
53 
54 
55 
56 
58 
59 
5a 


61 
62 
63 
64 
65 
66 
67 
68 
69 
6b 
6c 
6d 
6e 


61 
62 
67 
64 
65 
7a 
68 
75 
69 
6b 
6c 
6d 
6e 
6a 
6f 
70 
72 
73 
74 
79 
66 
78 
63 
76 


41 
42 
47 
44 
45 
5a 
48 
55 
49 
4b 
4c 
4d 
4e 


41 
42 
43 
44 
45 
46 
47 
48 
49 
4a 
4b 
4c 
4d 
4e 
4f 
50 
51 
53 
54 
55 
56 
57 
58 
59 


61 
62 
63 
64 
65 
66 
67 
68 
69 
6a 
6b 
6c 
6d 


41 
42 
44 
45 
46 
49 
4a 
4b 
4c 
4d 
4e 
4f 
50 
51 
52 
53 
55 
56 
58 
59 
5a 
5b 
5c 
5d 


61 
62 
64 
65 
66 
69 
6a 
6b 
6c 
6d 
6e 
6f 
70 
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cl 
c2 
3 
c4 
c5 
c6 
c7 
c8 
c9 
ca 
cb 
(axe! 
cd 
ce 
Gf 
do 
dl 
d3 
d4 
d5 
d6 
d7 
d8 
d9 
da 


db 


dc 
dd 
de 
df 
e0 


el 
e2 
e3 
e4 
e5 
e6 
e7 
e8 
e9 
ea 
eb 
ec 
ed 
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diaeresi 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
Capital 
diaeresi 
Capital 
diaeresi 
Small 
Small 
Small 
Small 
Small 
and diae 
Small al 
Small be 
Small ga 
Small de 
Small ep 
Small ze 
Small et 
Small th 
Small io 
Small ka 
Small la 
Small mu 
Small nu 


et 


May 


s 
alpha 
beta 
gamma 
delta 
epsilon 
zeta 

eta 

theta 
iota 
kappa 
lamda 

mu 

nu 

xi 
omicron 
pi 

rho 

sigma 

tau 
upsilon 
phi 

chi 

psi 

omega 
iota with 
s 

upsilon with 
s 


a with acute 


iota with acute 
upsilon with acute 


resis 
pha 
ta 
mma 
lta 
silon 
ta 

a 

eta 
ta 
ppa 
mda 


1996 


alpha with acute 
epsilon with acute 
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0646 


37 


M7 


gL 


MC 


23 


69 


LG 


L1 


03be 
O3bf 
03c0 
03er 
03c2 
03c3 
03c4 
03c5 
03c6 
03c7 
03c8 
03c9 
O3ca 
03cb 


O3cc 
O03cd 
O3ce 


Note: 


a5 
a6 
a7 
a8 
aa 
a9 
ab 
ac 
ad 
ae 
af 
e0 
e4 
e8 


e6 
e7 
e9 


ee 
ef 


e8 
e9 


ea 
ef 
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ab 


e8 
e9 


£0 
fl 
£2 
£3 
£4 
£5 
f6 
f7 
f8 
f9 
fa 
fb 


fC 
fd 
fe 


ea 
eb 
ed 
ec 
ee 
f2 
f3 
f4 
f6 
fa 
ad 
fb 


a2 
a3 
fd 


£0 
£2 
£7 
£3 
£4 
£9 
e6 
£8 
e3 
£6 
fb 
fc 


de 
e0 
fl 


All values 


ac 
ad 
ae 
af 
ba 
bb 
be 
bd 
be 
bf 
db 
b4 
b8 


b6 
b7 
b9 


ea 
eb 
ed 
ec 
ee 
f2 
f3 
f4 
f6 
fa 
ad 
fb 


a2 
a3 
fd 


6f 
70 
71 
72 
77 
713 
74 
75 
76 
78 
79 
7a 


4a 
4f 
50 
52 
57 
53 
54 
59 
46 
58 
43 
56 


6e 
6f 
70 
shale 
72 
73 
74 
75 
76 
77 
78 
79 


are in hexadecimal. 


71 
72 
T3 
75 
77 
76 
78 
79 
7a 
7b 
Tc 
7d 


ee 
ef 
£0 
fl 
f2 
f3 
f4 
£5 
f6 
£7 
£8 
£9 
fa 
fb 


fc 
fd 
fe 


E-mail Messages 


Small 
Small 
Small 
Small 
Small 
Small 
Small 
Small 
Small 
Small 
Small 
Small 
Small 
Small 


May 1996 


xi 

omicron 

pi 

rho 

final sigma 
sigma 

tau 

upsilon 

phi 

chi 

psi 

omega 

iota with diaeresis 
upsilon with 


diaeresis 


Small 
Small 
Small 


omicron with acute 
upsilon with acute 
omega with acute 


The column headers refer to the following character sets: 


The ISO 2DIS 10646 code. 


PC code page 737 also known as 437G. 
tions of this code page do not include capital letters with acute. 


Note that some implementa- 


Character set 8859-7 as implemented in Microsoft Windows 3.1, 
and Microsoft Windows 95. 


Microsoft Windows 3.11, 


IBM code page 851. 


The Greek code page implemented on the Apple Macintosh computers. 


IBM code page 423 


(EBCDIC-CP-GR) . 


IBM code page 869. 


Latin Greek 


(iso-ir-19). 


Latin Greek 1 
ital letters whose glyphs do not exist in the Latin alphabet. The 
other capital letters are rendered using the equivalent Latin let- 


ter 
tal letter A"). 


Spinellis 


(e.g. 


(iso-ir-27). 


"Greek capital letter alpha" 


Informational 


This page only contains the Greek cap- 


is rendered as "Latin capi- 
When mapping "Latin Greek 1" text to ISO 8859-7 
the Latin capital letters should only be transcribed to the 
equivalent Greek ones if a suitable heuristic determines that the 
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G7 


GO 


GC 


28 


97 


specific Latin letters are used to represent Greek glyphs. 
7 bit Greek (iso-ir-88). 

Old 7 bit Greek (iso-ir-18). 

Greek CCITT (iso-ir-150). 

Character set ISO 5428:1980 (iso-ir-55). 


The target character set ISO 8859-7:1987 (ELOT-928) (iso-ir-126). 


MIME Headers 


A mail message that contains Greek text must contain at least the 
following MIME headers: 


MIME-Version: 1.0 
Content-type: text/plain; charset=ISO-8859-7 
Content-transfer-encoding: BASE64 | Quoted-Printable 


In the future, when all email systems implement fully transparent 
8-bit e-mail as defined in RFC 1425 and RFC 1426 the message body 
encoding phase described in this standard will be no longer 
needed. In this case the requisite MIME headers are modified as 
follows: 

MIME-Version: 1.0 

Content-type: text/plain; charset=ISO-8859-7 

Content-transfer-encoding: 8BIT 
Even when RFC 1425 is used, Q or B encoding will continue to apply 
to message headers as detailed in the following section. 


Optional 


It is recommended, although not required, to support Greek encod- 
ing in mail headers as specified in RFC 1522. Specifically, the 
B-encoding format is to be the default method used for encoding 
Greek text in RFC-822 mail headers, and the Q-encoding format the 
method to use for the exceptional case of encoding a single Greek 
word or letter in an otherwise Latin-character-based header. 
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Example 


Below is a short example of Quoted-Printable encoded Greek 


email: 
Date: Wed, 31 Jan 96 20:15:03 EET 
From: Diomidis Spinellis <dds@senanet .com> 
Subject: Sample Greek mail 
To: Achilleas Voliotis <achilles@theseas.ntua.gr> 


MIME-Version: 1.0 

Content-ID: <Wed_Feb_14_ 18 49 50_EET_1996_0@senanet> 
Content-Type: Text/plain; charset=ISO-8859-7 
Content-Transfer-Encoding: Base64 


yuHr5+zd8eEsCgrU7yD16+vn7eng/CDh6/bc4uf07yDh80/05ev13/Th6SDh8PwgMjYg4/He 
70zh90EuCg== 


Discussion 


It is possible [RFC1428] (and unfortunately common practice) to 
set up an arrangement of mail user and transfer agents that allow 
end users to communicate with Greek e-mail messages while 
violating a number of standards. Such arrangements are unlikely 
to offer wide scale interoperability. 


One common error is to arrange the rendering and composition of 
Greek messages by rigging a mail user agent hosted in an ISO 
8859-1 environment to use a presentation font that contains Greek 
glyphs and a keyboard input method that generates Greek text using 
those glyphs. The resulting messages begin with header items 
indicating contents in the ISO 8859-1 character set and include 
text in a totally different encoding. Unfortunately this 
"solution" appears to "work" across similar systems and is widely 
used. 


One other error is to tag Greek text generated on Microsoft 
Windows platforms as ISO 8859-7 without an intermediate 
translation phase. It is important to note that the character set 
used by the Microsoft Windows Greek implementations is NOT the 
same as the ISO 8859-7 representation. First of all, the 
character set used to represent Greek characters differs slightly 
from the ISO 8859-7 encoding (this difference was instrumented in 
order to rectify the appearance of an early version of Microsoft 
Word for Windows in which the end-of-section symbol clashed with 
the "Greek capital alpha with acute" glyph). In addition, a 
number of 8-bit characters available on Greek Windows 
implementations are not part of the ISO 8859-7 character set. 
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Note that the ISO 8859-7 encoding is equivalent to the Greek 
Standards Organisation ELOT-928 encoding. 
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Security Considerations 
Security issues are not discussed in this memo. 
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